Automated Medical Citation Records Creation for Web-Based On-Line Journals
نویسندگان
چکیده
With the rapid expansion and utilization of the Internet and Web technologies, there is an increasing number of on-line medical journals. On-line journals pose new challenges in the areas of automated document analysis and content extraction, database citation records creation, data mining, and other document related applications. New techniques are needed to capture, classify, analyze, extract, modify, and reformat Web-based document information for computer storage, access, and processing. At the National Library of Medicine (NLM) we are developing an automated system, temporarily code-named WebMARS for Web-Based Medical Article Record System, to create citation records for the MEDLINE® database. The system downloads and classifies Web document articles, parses and labels the article contents, extracts and reformats the citation information from the article, presents the entire citation to operators for reconciling (validation), and uploads the citation records to the MEDLINE database.
منابع مشابه
Automated Document Labeling
An increasing number of publishers are using the Internet and the World Wide Web to provide their subscribers with access to online journals. New techniques are needed to capture, classify, analyze, extract, modify, and reformat Web-based document information for computer storage, access, and processing. An R&D division of the National Library of Medicine (NLM) is developing an automated system...
متن کاملStyle-independent document labeling: design and performance evaluation
The Medical Article Records System or MARS has been developed at the U.S. National Library of Medicine (NLM) for automated data entry of bibliographical information from medical journals into MEDLINE®, the premier bibliographic citation database at NLM. Currently, a rule-based algorithm (called ZoneCzar) is used for labeling important bibliographical fields (title, author, affiliation, and abst...
متن کاملAutomated Cleanup Processing for Extracting Bibliographic Data from Biomedical Online Journals
An R&D division of the National Library of Medicine (NLM) has developed the Web-based Medical Article Records System (WebMARS) to create citations from online biomedical journals. This paper presents one important part of this system, the automated cleanup module that extracts bibliographic information from HTML-formatted text based on a rule-based approach. A learning scheme comparing the outp...
متن کاملA Bibliometric Analysis of Toxicology Publications of Iran and Turkey in ISI Web of Science
Background: Web of Science (WoS) is an online academic citation index provided by Thomson Reuters which supplies valuable bibliometric information for comparing impact of specific author, organization, or country in science production. The aim of this study was to compare toxicology publications of Iran and Turkey indexed in WoS from bibliometric point of view. Methods: The WoS database was ...
متن کاملRadiology, nuclear medicine, and medical imaging: a bibliometric study in Iran
Introduction: Nowadays, science mapping is considered an excellent technique for decision-makers to find solutions for problems in research planning and development. In this work, we aimed to depict a science map of “radiology, nuclear medicine, and medical imaging” in Iran. Methods: All publications indexed in Thomson Reuters Web of Science database in the fields mentioned above with at least...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001